POLYNOMIAL-TIME WORD PROBLEMS 



SAUL SCHLEIMER 

Abstract. We find polynomial-time solutions to the word prob- 
lem for free-by-cyclic groups, the word problem for automorphism 
groups of free groups, and the membership problem for the han- 
dlebody subgroup of the mapping class group. All of these re- 
sults follow from observing that automorphisms of the free group 
strongly resemble straight line programs, which are widely stud- 
ied in the theory of compressed data structures. In an effort to be 
self-contained we give a detailed exposition of the necessary results 
from computer science. 



1. Introduction 

Automorphisms of the free group are closely connected to two tech- 
niques in computer science: string matching and compression. The 
relevance of the first is obvious. The second is less clear. So, consider 
the fact that an automorphism of complexity n can produce, by act- 
ing on a generator, words of size at most exp(n). Now, there are only 
exp(n) such automorphisms while there are exp(exp(n)) words avail- 
able as output. Thus most words in the free group cannotbe obtained 
in this way. Those which can are highly regular and thus susceptible 
to compression. 

Compression techniques have already made an appearance in algo- 
rithmic topology. Word equations play a starring role in the work 
of Schaefer, Sedgwick, and Stefankovic [23]. One of the problems they 
consider, connectedness of normal curves and surfaces, is also addressed 
by the orbit-counting techniques of Agol, Hass, and Thurston [1]. Both 
results rely, directly or indirectly, on Plandowski's Algorithm [22] (The- 
orem 8.1 below). 

The structure of the paper is as follows: Section 2 reviews straight 
line programs and also a slight variant, composition systems. Such 
programs are called compressed words. Section 3 is an exposition of 
Lohrey's Theorem [16]: 
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Theorem 3.5. The word problem for compressed words in the free 
group is solvable in polynomial time. 

The same techniques solve several open problems: we give polynomial- 
time algorithms for the word problem in free-by- cyclic groups (Sec- 
tion 4), the word problem in Aut(F m ) (Section 5), and various mem- 
bership problems (Section 6). In each case the compression technique 
"accelerates" the obvious exponential-time algorithm. 

Sections 7 and 8 give expositions of theorems due to Hagenah [10] and 
Plandowski [22], upon which Lohrey's Theorem relies. As a result the 
computer science portions of the paper are self-contained. I have been 
somewhat more permissive when using techniques from combinatorial 
group theory or three-manifolds. 

The paper ends with a brief appendix (Section A) indicating how 
these techniques extend to closed surface groups. 

Acknowledgements. I thank Martin Bridson and Nathan Broaddus 
for both inspiring seminar talks and for illuminating conversations. 
Without such models this paper would not exist. I also thank Ilya 
Kapovich for his comments on an early version of this paper. 

2. Straight line programs and composition systems 

Recall that if £ is a set of characters, or an alphabet, then the Kleene 
closure C* is the set of all words (finite strings of characters) in C If 
w G C* then \w\ is the length of w: the number of characters of w 
counted with multiplicity. We write e for the empty string: the word 
of length zero. If u and v are words we write u = v if and only if u and 
v are identical as strings. 

Fix a word w and assume < i < j < \w\. We take w[i : j] to be the 
substring starting immediately before the i + 1 th character of w and 
ending just after the j th character. Thus w[i : i] = e and, in general, 
\ w [i '■ j]\ = j — i- By convention negative indices count from the end of 
w. Thus: 

w[— j : —i] = w[\w\ — j : \w\ — i]. 

The following abbreviations will be useful: w[: i] for w[0 : i], w[i :] for 
w [i : \w\], and w[i] for w[i : i + 1]. Thus w[0] is the first character of w 
while w[— 1] is the last. 

If u,v are words in C* then their concatenation is denoted by u • v. 
This leads to the pleasant identity w = w[: i] ■ w[i :]. Also, in order 
to rotate a word w exactly i characters simply form w[i :] ■ w[: i\. For 
any word w G C* define rev(w) to be the reverse of w: so rev(w)[i] = 
w[—i — 1]. 
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A straight line program A = (C,A,A n ,V) contains the following: a 
finite alphabet C = {ai, . . . , a m } of terminal characters, a disjoint finite 
alphabet A = {A±, . . . , A n } of non-terminal characters, a root terminal 
A n G A, and a set V = {Ai — > Wi} of production rules. These last 
allow us to replace a non-terminal Ai with its production: a (possibly 
empty) word Wi in (C U A)*. Every non-terminal Aj appearing in 
Wi has index j < i. When indices are unimportant we shall use the 
non-terminal A to represent the root of A. 

To justify the term "root" define the production tree of a character 
in C U A as follows: The tree for a terminal a e £ is a single vertex, 
labelled a. The tree for a non-terminal Ai G A is planar, has root 
labelled Ai, and, attached to the root in left-to-right order, there is a 
copy of the production tree for every character of Wi. 

If B G £ U A then the height of B, denoted ||JB||, is the height of 
the production tree of B: the maximal distance from the root to a leaf. 
For example, terminals have height zero. 

Define w(A) = wa to be the word of C* that results from running 
the straight line program A. That is, if A n is the root then produce 
W n , replace all non-terminals appearing with their productions, and 
continue doing so until the resulting word lies in C*. This is exactly 
the word appearing at the leaves of the production tree of A. Also, 
define w(Ai) = WAi to be the word in C* produced by the non-terminal 
A. 

A straight line program is in Chomsky normal form if every produc- 
tion Wi has length one or two: all of the former lie in L while all of the 
latter lie in A*. 

Remark 2.1. Every straight line program can be placed in normal 
form in time polynomial in Ej|Wj|. This is done by introducing dummy 
non-terminals. 

Remark 2.2. Two remarks are in order about the meaning of polyno- 
mial time. First, the variable in question is the bit-size of the input. For 
example, a straight line program in normal form, with n non-terminals, 
has bit-size 0(nlog 2 (n)). The exact bit-size depends on a choice of en- 
coding. Second, the precise running-time of an algorithm depends on 
the model of computation. However, it is generally true that changing 
encoding or model of computation transforms the bit-size or running- 
time by a polynomial function. Thus the claim that some problem may 
be solved in polynomial time is essentially independent of these choices. 
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Example 2.3. Here is the canonical example of a straight line pro- 
gram: 

/ {a,b},{Fi},F n , 

\{Fi^ • F i _ 2 }^ 3 U {F 2 ^a,F x ^ b} 

So w(Fi) is the i th Fibonacci word. For example 
w(F 8 ) = abaababaabaababaababa, 
w(Fg) = abaababaabaababaababaabaababaabaab. 

It follows that the length |w(F n )| grows exponentially with n. 

Composition systems are a more flexible version of straight line pro- 
grams (in normal form), introduced in [9]: If A,B,C G A, then pro- 
ductions of the form A — > B[i : j] ■ C[k : I] are allowed. Here B[i : j] 
is a truncated non-terminal. Truncated non-terminals only appear on 
the right hand side of productions, never on the left. A truncated non- 
terminal B[i : j] is well formed if the indices satisfy < i < j < \wb\- 
Define w(B[i : j]) = WB[i '■ j]- Repeated truncation behaves quite 
simply: (B[i : j])[k : I] — B[i + k : i + I). 

Example 2.4. Consider the straight line program in normal form 

{a, 6}, {A { }, A n , 

{Ai -> • Ai_i}^ 4 U {A 3 -> A 2 ■ A x , A 2 ^a,A x ^ b} 

Of course, w{Ai) = (ab) 2 ' 3 for i > 3. Truncate a little to obtain: 

{a,b},{Bi},B n , 
{B^B^l :] ■ :]}L 5 U 
U{B 4 -> 5 3 ■ B 3 , B 3 ^B 2 -B 1 ,B 2 ^ a, B 1 6} 

Now the output appears to be more interesting: for example 

w(B§) = bababbabbbababbabbababbabbbababbab. 

For future use, we record: 

Lemma 2.5. If A is a composition system of height \ \A\\ then \wa\ < 
2 PII. □ 

Lemma 2.6. There is a polynomial-time algorithm that, given a com- 
position system A, computes \wb\ for all B G A. □ 

Lemma 2.7. There is a polynomial-time algorithm that, given a com- 
position system A and an integer i, computes the character wa\i\- D 

Remark 2.8. In particular, suppose we are given i, j with < i < j < 
\wa\- Then we may compute the word w^fi : j] in time polynomial in 
j — % and in the size of A. 
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It is perhaps surprising that the expressive power of composition 
systems and straight line programs are nearly the same. In Hagenah's 
thesis [10, Chapter 8] we find: 

Theorem 7.1. There is a polynomial-time algorithm that, given a com- 
position system A, finds a straight line program X with the property that 
w x = w A . 

The algorithm and the proof of correctness are presented, for the 
convenience of the reader, in Section 7. A more subtle result is due to 
Plandowski [22]: 

Theorem 8.1. There is a polynomial-time algorithm that, given straight 
line programs A and X in normal form, decides whether or not wa = 
w x . 

In an attempt to be self-contained a proof appears in Section 8. 
Gasieniec, Karpinski, Plandowski, and Rytter [9] strengthen Theo- 
rem 8.1 as follows: 

Theorem 2.9. There is a polynomial-time algorithm that, given com- 
position systems A and X, computes the largest integer k > so that 
w A [- k] = w x [- k]. 

Proof. This follows from Theorems 7.1 and 8.1. Compute k via binary 
search. □ 

3. Lohrey's algorithm 

We now turn our attention to the free group. Let C m = {a^a;}™^. 
Let 7 : C m — > C m be the obvious involution. Given a word w(ai) G C* m 
define w to be rev(w(aj)). 

Compressed word is the umbrella term for a straight line program or 
composition system which produces a word in C* m . The involution given 
above extends to compressed words; constructions like B are allowed 
on the right hand side of productions. If A — > B[i : j] ■ C[k : 1} then 
A — >■ C[-l : -k] ■ B[-j : -i}. Finally, if A — > a then A — ► a. See 
Example 3.2 below for an illustration. 

Lemma 3.1. There is a polynomial-time algorithm that, given a com- 
pressed word A, computes a new compressed word A so that w{A) = 
w(A). □ 

Now suppose that {ai, . . . , a m } generate the free group F m and that 
Hi is the inverse of a^. Recall that a word in the free group is freely re- 
duced if it has no subwords of the form aia~i or a,a, for any i e {1, . . . m}. 
A word is cyclically reduced if all of its rotations are freely reduced. 
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Example 3.2. To simplify notation inside this example, let £2 = 
{a, b, a, b}. Form a straight line program 

^ _ / £2, {Ai, Bi}^ =0 , A n , 

\ {Ak+i — > B k , B k+ i — > BkAkBk}^ U {A — > a, £? — > &} 

Thus 

u>(A 5 ) = babbbabbabbabbbabbabbbabbabbbabbabbabbbab. 

Note the close relation with y? 5 (a) where F2 — > i<2 is the automor- 
phism a 1 — ^ 6, 6 1 — > bob. Notice also that w{A^) has both free and cyclic 
reductions. 

Using the generalization of Plandowski's work (given as Theorem 2.9 
above), Lohrey [16] has proven: 

Theorem 3.3. There is a polynomial-time algorithm that, given a 
straight line program A in normal form, finds a composition system 
X with the property that wx is the free reduction of wa- 

Proof Induct on n. Suppose A = (C m , A, A n , V) is the given straight 
line program. Now define a composition system X = (C m , X,X n , Q). 
For every non-terminal A, of height one in A place Xi in X and add 
the production Xi — > w(Ai) to Q. 

Now, if height {A n ) = 1 then there is nothing more to prove. So 
assume that height {A n ) > 2. By induction assume that X iy for i < n, 
lies in X and assume that the corresponding production lies in Q. Thus 
w(Xi) is freely reduced for all i < n. Now place X n in X and consider 
what X n will produce. 

Suppose that A n — > Ai-Aj. Build Xi using the algorithm of Lemma 3.1. 
Apply the algorithm of Theorem 2.9 to find the largest k so that 
w(Xi)[: k] = w(Xj)[: kj. Add the production 

X n ^Xi[: -k}-Xj[k :] 

to Q. The word w{X n ) is now freely reduced. □ 

Example 3.4. We continue Example 3.2. Given A as above Lohrey's 
algorithm produces the following composition system: 

/ £2, {Xi, Yi}, X n , 
x = ( {X l+1 - Y u Y t+l - y;[:_2] • F^u 

\ U{X X - Y , Y, - Y X Y , X ^a,Y ^ b} 

For example, w(X 5 ) = bababab. Deduce that w(X k+2 ) = w(Y k+ i) = 
ba ■ w(Y k ) for k > 1. 

There is an important corollary of Theorem 3.3: 
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Theorem 3.5 (Lohrey [16]). The word problem for compressed words 
in the free group is solvable in polynomial time. □ 

We notice two more consequences. 

Corollary 3.6. There is a polynomial-time algorithm that, given a 
compressed word A, finds a compressed word X with the property that 
Wx is the cyclic reduction of wa- The algorithm also gives the com- 
pressed conjugating word. 

Proof. Following Theorems 7.1 and 3.3 assume that wa is freely re- 
duced. Using the algorithm of Lemma 3.1 produce the compressed 
word A. Now apply the generalization of Plandowski's algorithm (The- 
orem 2.9) to find the largest k so that wa[- k] = w-^[: k]. It follows 
that the composition system A' — > A[: k] produces the conjugating 
word. Also, the composition system X — > A[k : —k] produces the 
cyclic reduction of wa, as promised. □ 

The second consequence is more subtle: 

Theorem 3.7. The conjugacy problem for compressed words in the 
free group is solvable in polynomial time. The algorithm also computes 
the compressed conjugating word. 

We only sketch the proof, as the theorem is not used in the sequel. 

Proof sketch of Theorem 3. 7. Suppose that A and X are the given com- 
pressed words. Using Corollary 3.6 assume that A and X produce cycli- 
cally reduced words. Using Lemma 2.6 check that \wa\ and \wx\ are 
equal. 

Let W be the compressed word with root production W X ■ X. 
That is ww = wx • wx- Thus, to prove that wa and wx are conjugate 
it suffices to prove that wa appears as a subword of ww But this is 
exactly a special case of the fully compressed pattern matching problem 
which can be solved in polynomial time. See, for example, the work 
of Karpinski, Rytter, and Shinohara [14], of Gasieniec etal [9], or of 
Miyazaki, Shinohara, and Takeda [20]. □ 

4. Free-by-cyclic groups 

For group theory background the reader should consult Lyndon and 
Schupp's book [18]. Recall the definition of Aut(F m ): the group of all 
automorphisms of the free group F m . Fix $ e Aut(F m ). The free-by- 
cyclic group Gq> is presented by: 

(ai,t | tait = $(oi),i e {1, . . . ,m}). 

The goal of this section is to prove: 
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Theorem 4.1. The word problem for G$ is polynomial time. 

This problem is already known to be in NP: Bridson and Groves [5] 
show that G$> has a quadratic isoperimetric inequality. 

Proof of Theorem J^.l. Let C m = {a>i,ai} and M. = C m U {t,t}. Let 
A = {A i)P | % E {1, . . . m},p e N}. Fix $ e Aut(F m ) by assuming that 
the words ifj(ai, . . . , a m ) = $(a,) are given as input. 
Define production rules as follows: 

Aifl — > a i 

Ai iP — > Ui(Ai jP —i, . . . , A m! p-i), p>l 

Suppose now that W is a word in M* . The length of W determines 
the size of the given word problem. Now rewrite W in stages: First 
freely reduce. Next replace every and Oj appearing by A it0 and by 
A i , respectively. Now move all occurrences of t to the right, and of t 
to the left, rewriting as follows: 

t ■ A itP A itP+ i ■ t 

t ■ Ai p ~» ^4j,p+i ■ t 

Ai.p -t ~> t • Ai p+i 

Ai^p -t ~> t • ^4j,p+i 

t -t ^ € 

The result is a word in { A,p, ^i,p}* ; possibly with powers t fc and 
t appearing at the end and beginning. Let W'(Ai tP ) be this word, 
omitting the leading and trailing powers of t. Construct a straight line 
program with a root non-terminal A — > W'(Ai tP ). Notice that W is 
trivial in G<$> if and only if the word wa freely reduces in F m and the 
powers satisfy k = I. (This is a simple form of Britton's Lemma. See 
page 181 of [18].) 

However the latter occurs if and only if the total exponent of t in W 
is zero. The former is exactly solved by applying Lohrey's algorithm 
(Theorem 3.3). □ 

Remark 4.2. The statement of Theorem 4.1 may be generalized to 
ascending HNN extensions: $ is assumed to be an injection instead of 
an automorphism. The proof is identical. The same question for HNN 
extensions in general seems to be more delicate. 
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5. The automorphism group of a free group 

We now examine the automorphism group in greater detail. Recall 
that the automorphism group Aut(F m ) is finitely generated by the 
Nielsen generators (see Chapter 1.4 of [18]): 

(1) cti G Aut(F m ) so that ai\C m interchanges and cij, fixing all 
other elements of C m . 

(2) G Aut(F m ), with i ^ j, has /3y(a;) = a^, and /%(a fc ) = a k 
for all k 7^ i. 

Remark 5.1. Choosing a different generating set alters running times 
by at most a multiplicative constant. The choice above simplifies the 
proof below. 

Theorem 5.2. The word problem for Aut(F m ) is polynomial time. 

This solves problem (CI) on the list maintained by Baumslag, Myas- 
nikov, and Shpilrain [2]. 

Proof of Theorem 5.2. Suppose that $ = tpi . . . <p n is a word in the 
Nielsen generators of Aut(F m ). We must check that $(aj) freely reduces 
to aj, for all i. 

To do this, define a straight line program: Let {A,p} be the set 
of non-terminals with i G {1, . . . m} and p G {0, 1, . . . n}. Create the 
following production rules: 

Aifi — > di 

A, P — >■ (p p (Ai, P -i), V > 1 

If ip p = oti is of the first kind then ip p (A itP ) equals A itP _i while ip p (Aj tP ) 
equals Aj tP -i, for j ^ i. If <p p = is of the second kind then ip p (A itP ) 
equals • and so on. 

Now apply the algorithm of Theorem 3.3 to rewrite this straight 
line program so that all outputs are freely reduced. If the resulting 
composition system has |w(Aj n )| > 2 for any i then the automorphism 
$ is nontrivial. If |ty(Aj jn )| = 1 for all % then, using the algorithm of 
Lemma 2.7, check if w(A itn ) = a^. If this is the case for all % then $ is 
the identity element of Aut(F m ). □ 

Remark 5.3. In our analysis of the word problem for free-by-cyclic 
groups we could accept as input both the word W in A4* and auto- 
morphism $ given as a word in the Nielsen generators. Now we need 
not precompute the words Wj. Instead we find, as in the proof of The- 
orem 5.2, straight line programs producing these words. The running 
time of Theorem 4.1 then becomes polynomial in the two inputs W 
and $. 
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Note that solving the word problem for a group also solves the word 
problem for subgroups. Of course, there are many beautiful subgroups 
of Aut(F m ). As just a single example consider the braid group, B m : 
Let D m be a disk with m points removed from the interior. Then B m is 
the group of homeomorphisms of D m which fix the boundary pointwise, 
modulo boundary and puncture fixing isotopies. Choosing a basepoint 
on the boundary makes B n act on iri(D m )=F m and so embeds B n into 
Aut(F m ). A simple corollary to Theorem 5.2 is the well-known: 

Corollary 5.4 ([3], [15], [8], [4], [11]). The word problem for B m is 
polynomial time. □ 

6. Membership problems 

We now turn our attention to membership problems, also called gen- 
eralized word problems. Suppose that if is a subgroup of a finitely 
presented group G. We seek an algorithm that, given a word W writ- 
ten in the generators of G, decides if W represents an element of H. 
Note that if H is normal in G then such an algorithm also solves the 
word problem in the quotient G/H. 

Recall that Inn(F m ) is the normal subgroup of Aut(F m ) which con- 
tains for all words U E C* m . <&u(V) = UVU. The quotient is the 
outer automorphism group, Out(F m ). 

Theorem 6.1. The word problem for Out(F m ) is polynomial time. 

Proof. To see this, note that the membership problem for Inn(F m ), 
inside of Aut(F n ), is solved by the construction given in the proof of 
Theorem 5.2 and by Corollary 3.6. □ 

Another interesting membership problem is that of the braid group 
(or more generally, mapping class groups of punctured surfaces) inside 
of Aut(F m ). In order to avoid multiplying examples we only discuss 
the problem for the braid group: it suffices to check that the boundary 
word is fixed and all punctures are preserved, up to conjugacy. (See [19, 
Theorem N6], for example.) Corollary 3.6 can check the latter and the 
former is dealt with by Lohrey's Algorithm (Theorem 3.3). 

Here is another kind of membership problem, first proposed by Nathan 
Broaddus. Let M.CQ{S g ) denote the mapping class group of the closed 
connected orientable genus g surface, S g . Consider a handlebody V g so 
that dV g = S g . Then MCQ{V) naturally includes in MCg(S). 

Remark 6.2. Recall the fundamental fact that $ G AiCQ(S) lies in 
the subgroup M.CQ{V) if and only if $ preserves the set of meridians: 
the set of curves in S which bound disks in V. In fact a "weaker" 
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condition is equivalent: let © be a collection of g disjoint disks in V so 
that VxD is a three-ball. Then $ £ M.CQ{V) if and only if the curves 
$((9D) bound disks in V. 

Fix a point x £ 5 and let {ai, . . . , a g , b±, . . . , b g } be the standard set 



of generators of tti(S, x) 
meridians. See Figure 1. 



We arrange matters so that all of the bi are 




Figure 1. Generators for the fundamental group of S. 

Choose also a standard set {r} of Dehn twist generators for AiCQ(S). 
In fact we will over-specify these twists: for each Dehn twist in the 
generating set choose a twisting curve {a} which avoids a small neigh- 
borhood of the basepoint x £ S. See Figure 2. 

Pick finally a point y in this small neighborhood of x, avoiding all 
the loops Chi and bi. It follows that 7Ti(5\{?/}, x)=F 2g is freely gen- 
erated by the aj and 6j loops. Also, the Dehn twists give free group 
automorphisms. (See [19, Theorem N10].) Broaddus tells us the useful: 

Lemma 6.3. Fix W £ {eij, bi, aj, bi}* so that W is homotopic in S to 
a simple closed curve u. The following two conditions are equivalent: 

• uo is a meridian. 

• After deleting all bi 's and bi 's from W the resulting word in 
{ai,ai}* freely reduces to the empty word. 

Proof. Consider the map ni(S, x) — > 7Ti(V, x)=F g induced by inclusion. 
As in Figure 1 all of the bi lie in the kernel while the images of the aj 
freely generate 7Ti(V). Accordingly, identify 7Ti(V) and the free group 

W). 
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Figure 2. Generators for the mapping class group. 

Suppose now that the first condition holds: ui is a meridian. Then W 
lies in the kernel. Since the 6j normally generate the kernel the second 
condition follows. 

Suppose instead that the second condition holds. Since W is in the 
kernel, deduce that to bounds a singular disk in V. By the celebrated 
Loop Theorem [12] u is a meridian. □ 

We are now prepared to prove: 

Theorem 6.4. The membership problem for M.CQ{V) in MCQ(S) is 
polynomial time. 

Proof. As is well-known [13] the 2g + 1 Dehn twists shown in Fig- 
ure 2 generate the mapping class group, M.CQ{S). So fix a word 
$ = 7~i . . . T n , written in terms of these twists and their inverses. By 
Remark 6.2 it is enough to check that thought of as a word in 

the free group generated by the otj and the bi, satisfies the second con- 
dition of Lemma 6.3. This could be done directly, but might have 
length exponential in n. 

Instead, for each i encode as a compressed word, say A(z). It 

is a triviality to remove all bi and bi appearing in w(A(i)): for every 
non-terminal B with B — > bi replace the production by B — > e. Call 
the new compressed word A'(i). 

Now run Lohrey's algorithm (Theorem 3.3) on A'(i). The mapping 
class $ lies in M.CQ(V) if and only if w(A'(i)) freely reduces to the 
empty string, for all i. □ 
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Remark 6.5. Suppose that W is another handlebody of genus g with 
dW = S so that V Us W is the three-sphere. Here the a, loops 
bound disks in W. Recall that the Heegaard group 7i = M.CQ{V) H 
M.CQ{W) is the subgroup of MCQ(S) consisting of all mapping classes 
which extends over both V and W. Applying Theorem 6.4 twice 
gives a polynomial-time algorithm for the membership problem of 7i 
in MCG(S). 

7. HAGENAH'S ALGORITHM 

We now discuss the computer science underpinnings of the discussion 
above. To begin, in Hagenah's thesis [10, Chapter 8] we find: 

Theorem 7.1. There is a polynomial-time algorithm that, given a com- 
position system A, finds a straight line program X with the property that 
w x = w A . 

The exposition of this result in [10] is wonderfully clear. I present a 
proof only to make this paper self-contained. 

Proof of Theorem 7.1. Fixing notation, suppose that A = (£., A, A, V). 
Note that all productions in V are either of the form B — > C[i : j] -D[k : 
I] or of the form B — > a[i : j]. 

Build the straight line program X = (£, X,X, Q) from the bottom 
up. The set X will contain plain non-terminals, one for each non- 
terminal of A, and decorated non-terminals, each associated to some 
plain non-terminal. Proceed as follows: for every non-terminal B G A 
of height one we add a plain non-terminal Y to X. Suppose B produces 
a[i : j]. Then 

• if j — i + 1 add Y — > a to Q and 

• if j — i add Y — > e to Q. 

Let A be the root non-terminal of A. Assume via induction that 
for every other non-terminal B G A a plain non-terminal Y has been 
added to X, so that wy = wb- We now describe the decorated non- 
terminals that may also, by induction, appear in X. Fix any plain 
non-terminal Y in X. Then yto'l is a decorated non-terminal. There 
are various cases: 

• If < i < j < \wy\ then Y^ is a subword non-terminal. 

• If < j < \wy \ then Y^ is a prefix non-terminal. 

• If < i < \wy \ then Y^ is a suffix non-terminal. 

• = y is the plain non-terminal. 

• y'"l = e is the empty word. 
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Repeated decoration behaves as expected: (Y^'-riy 1 '-^ = Y^ l+k '- l+l \ The 
production rules for decorated non-terminals are given below. 

Suppose now that the root has production A — > B[i : j] ■ C[k : I]. 
Suppose that Y and Z are plain non-terminals in X corresponding 
to the non-terminals B and C. Add a plain non-terminal X to X 
corresponding to A. Add the non- terminals and to X. Add 
the production rule X —> Y^ ■ Z^ to Q. 

Production rules are needed for every new decorated non-terminal, 
Y^\ created by the addition of the plain X. Suppose that the plain 
non-terminal Y produces U ■ V . (Here it is possible that U and V 
are themselves decorated non-terminals.) There are several cases and 
subcases: Suppose first that Y^ t: ^ is a subword non-terminal. 

Subcase Add to X Add to Q 

\wjj\ < i y[i-\wu\-j-\wu\l yWI _> y[i-\™u\--j-\wu\\ 

i < \wu\ < j U^'\ V^-\ w u\] — > ■ : -H™t/O 

j <\ Wu \ Y^ -> 

Suppose now that Y^ is a prefix non-terminal. 

Subcase Add to X Add to Q 

\ Wu \ < j V [:j ~ lwuli Y^ — >■ U ■ V [:j ~ lwu U 
j < \w v \ UM YM -> UM 

Suppose now that Y^ is a suffix non-terminal. 

Subcase Add to X Add to Q 

i < l^al Y^ -> f/M • 1/ 

Notice that creating the plain non-terminal A causes at most two 
decorated non-terminals to be created, both of lesser height. Every sub- 
word non-terminal in turn creates at most one subword non-terminal 
or at most one prefix and at most one suffix non-terminal. Again, these 
have lesser height. Finally, any prefix (suffix) non-terminal causes at 
most one prefix (suffix) non-terminal to be created. As usual, the 
height decreases. 

Suppose that n — \\A\\. It follows that the creation of the plain 
non-terminal X adds at most 1 + 2(2n) = 1 + An new decorated non- 
terminals to X. Thus the total number of non-terminals in X, at the 
end of the construction, is 

n 

n+ V(l + 4||5||) < n + ^(1 + Ai) = 2n 2 + An. 

BeA i=l 
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This completes both the description of the algorithm and its proof of 
correctness. □ 

8. Plandowski's Algorithm 
The final piece of the puzzle is: 

Theorem 8.1 (Plandowski [22]). There is a polynomial-time algorithm 
that, given straight line programs A and X in normal form, decides 
whether or not wa = Wx- 

A proof, essentially following [22], is provided for the convenience of 
the reader. 

Proof of Theorem 8.1. Let A = (£,A,A,V) and X = (£,X,X, Q). 
Note that we assume, as we may, that A and X have the same terminal 
alphabet. Making a copy of X if necessary, assume that A H X = 0. 
Finally assume that \wa\ = \wx\ and \A\ = m > n = \X\. 

We begin with the following definition: a triple (B, Y, i) is an asser- 
tion if: 

• B e AU £ zndY e X U £. 

• < i < \wb\- 

If < i and \wb\ < i + \wy\ then (B,Y,i) is a overlap assertion. If 
< i and i + \wy\ < \wb\, an d then (B,Y,i) is a subword assertion. 
Assertions of the form (Y, B, i), are defined similarly. We do not allow 
a pair of non-terminals from the same program to appear in a single 
assertion. 

An overlap assertion (B,Y,i) is satisfied if and only if ws[i '■] = 
Wy[. \wb\ — i\- Likewise, a subword assertion is satisfied if and only if 
wsii '■ i + \wy\] = wy- As a bit of terminology a set of assertions, T, 
is satisfied if and only if every assertion 7 e T is. In point of fact, the 
algorithm checks satisfiability of (B, Y, i) when and only when both B 
and Y are terminal characters and i — 0. 

In general a set of assertions r& is transformed into another such set, 
Tk+i- Beginning with r = {(A, X, 0)} the following properties will be 
maintained: 

(a) Tfc+i is satisfied if and only if is satisfied. 

(b) At most m + n — k elements of A U X are mentioned in T^. 

(c) For all k, \Fj.\ is bounded by (k + l)4mn(m + n). 

There are two ways to produce a new assertions from old, splitting and 
compacting. 
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Splitting. Fix T, a set of assertions. Suppose that, of all non-terminals 
from A and X appearing in T, the non-terminal B G A has maximal 
length. Fix 7 G V. We must define Split(7,S) and then Split(r,S). 

There are several cases to consider. If B does not appear in 7 then 
Split(7,£>) = {7}. Now suppose that 7 = (B,Y, i) or 7 = (Y, B,i). 
Note that 7 is either an overlap or subword assertion and that we 
have assumed \wb\ > \wy\. Suppose that B —> C ■ D. Now consider 
subcases. If 7 = (B,Y,i) is an overlap assertion then: 

Subcase Split (7, .B) Type 

i< \wc\ (C,Y,i) overlap 

(y, D, \w c \ — i) either 

\wc\ < i (D,Y,i — \w c \) overlap 

The table should be read as follows: When % < \wq\ then Split (7, B) 
contains two assertions: either a pair of overlaps or one of each type. 
When \wc\ < i the set Split (7, 5) contains a single overlap assertion. 
Suppose now that 7 = (B, Y, i) is a subword assertion: 

Subcase Split (7, B) Type 

i + \wy \ < \wc\ (C,Y,i) subword 

i+ \wy \ = \wc\ (C,Y,i) overlap 

i < \wc\ <i + \wy\ (C,Y,i) overlap 

(Y, D, \ w c \ — i) overlap 

i — \ w c\ {Y,D,0) overlap 

\wc\ < i (D,Y,i — \wc\) subword 

Suppose now that 7 = (Y, B, i) is an overlap assertion. As usual assume 
that \wb\ > \wy\- 



Subcase 


Split^B) 


Type 


\wy \ < i + \w c \ 


(Y,C,i) 


overlap 


i + \wc\ < \wy\ 


(Y,C,i) 


subword 




(X,D,i + \w B \) 


overlap 



Finally (Y,B,i) cannot be a subword assertion because \wb\ > \wy\. 
This finishes the definition of Split (7, B). Define 

Split(I\5) = U 7e rSplit( 7 , J B). 

Immediate from the definitions is: 

Claim 8.2. A set of assertions T is satisfied if and only if Split(r,S) 
is satisfied. □ 

Define now o(T) to be the number of overlap assertions in T. Sim- 
ilarly we take s(T) to be the number of subword assertions of T. So 
|T| = o(r) + s(r). From the tables above deduce: 
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Claim 8.3. Suppose that T is a set of assertions. Then 

(Spiit(r, p)) < o(r) + 2s(r) 
s(Spiit(r, p)) < o(r) + s (r) 

when P G A U X is a non-terminal of maximal length in T. □ 

Compact. Now for the definition of Compact (r). Note that, if u is a 
word, then p G N is a period of u if 

• 1 < p < \u\ — 1 and 

• = u [i + p] for all < i < \u\ — 1 — p. 

An immediate consequence of the definition is: 

Claim 8.4. Suppose that 7 = (B,Y,i) and 7' = (B,Y,j) are overlap 
assertions with i < j. Then 7 and 7' are satisfied if and only if 7 is 
satisfied and j — i is a period of the word %[i :]. □ 

We now give a restricted version of the famous Periodicity Lemma [17]: 

Lemma 8.5. If p and q are periods of u, where p + q < \u\, then 
gcd(p,q) is also a period of u. □ 

The following claim is the engine in the proof of correctness of 
Plandowski's Algorithm: 

Claim 8.6. Suppose that 7 = (P,F, i), 7' = (P,y,j), and 7" = 
(B, Y, k) are overlap assertions with i < j < k and j — i + k — i < 
\wb\ — i- Then 7, 7', and 7" are satisfied if and only if 7 is satisfied 
and gcd(j —i,k — i) is a period of WB[i ']■ 

Proof. This follows from two applications of Claim 8.4 and from the 
Periodicity Lemma 8.5. □ 

Equivalently 7, 7' and 7" are satisfied if and only if 7 and 5 = 
(B,Y,i + gcd(j — i,k — i)) are satisfied. This leads directly to the 
definition of SimpleCompact: given {7, 7', 7"} as in the hypothesis of 
Claim 8.6 define SimpleCompact ({7, 7', 7"}) = {7, 5}, with 5 as above. 

Now, for any set of assertions T define Compact (T) to be the result 
of applying SimpleCompact to all triples of overlap assertions which 
follow the requirements of Claim 8.6. Every successful application of 
SimpleCompact removes an assertion from T. Thus a single Compact 
operation involves calling SimpleCompact at most 0(|r| 4 ) times. (This 
can be greatly improved upon, if so desired.) We also record the fact: 

Claim 8.7. T is satisfied if and only if Compact (T) is satisfied. □ 
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Now to define T^+i in terms of IV Suppose that P G A U X is 
a non-terminal appearing in maximizing the length of \wp\. Then 
take 

Tfc+i = Compact(Split(Tfc, P)). 

Note that property (a) above is guaranteed by Claims 8.2 and 8.7 while 
property (b) is provided by the fact that every non-terminal is split for 
at most one value of k. We must now bound the size of IV 

Fix attention on any pair of non-terminals B G A and Y G X. Let 
{(B, Y, ij)}^ = i be the overlap assertions of which mention B and Y 
in that order and indexed so that ij < ij+i- As IV is compact it follows 
from Claim 8.6 that 

\wb\ — ij < ij+2 — ij + ij+i — ij- 
Since ij + i < z,- + 2 it follows that 

\{\^b\ - ij) < ij+2 - ij. 

Deduce that iV < 21og 2 (|ws|) + 1 < 2||JB|| + 1, with the last inequality 
following from Lemma 2.5. 

Claim 8.8. With r fc+1 as given: 

o(Tk+i) < mn(2m + 1) + nm(2n + 1) < 4mn(m + n) 
s(Tk+i) < Amn(m + n) + s(Tk) < k(4mn(m + n)). 
This verifies property (c) above. □ 

This completes both the description of the algorithm and its proof 
of correctness. □ 

Appendix A. On surfaces 

The discussion above gives a satisfactory picture of the behavior of 
compressed words in the free group. One immediately asks for a similar 
treatment of hyperbolic groups in general. However the situation there 
appears to require a new idea. 

Instead we briefly describe well-tempered paths: a beautiful geodesic 
language for closed surface groups. Our discussion is meant to be more 
inspiring than exhaustive: many details are omitted. For simplicity, 
we restrict ourselves to S = S2 the closed, orientable, connected genus 
two surface. 

Let D be the regular decagon in the hyperbolic plane with angles 
27r/5. Label the boundary of D with the word abcdeabcde, read counter- 
clockwise. The first five edges are oriented counter-clockwise while the 
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last five are oriented clockwise. Let L = {a,b,c,d,e,a,b,c,d,e}. See 
Figure 3. 




Figure 3. A labelled regular hyperbolic decagon. 

The decagon and its labelling extends to a tiling T> of the hyperbolic 
plane. Notice that a path in the one-skeleton determines a word in £,*. 
Let M. C C* be the subset that can be realized in this way. Conversely, 
a word of A4 determines a unique path in the tiling, up to the action 
of iri(S). When it cannot cause confusion we treat paths and words 
interchangeably. 

The following words of length five are called bad turns: 

edcba, baedc, ctebae, aedcb, cbaed, 

abcde, cdeab, eabcd, bcdea, deabc. 
A path is well-tempered if the corresponding word is freely reduced and 
contains no bad turn. The intent, and hence the name, is that these 
paths want to "turn right" as often as they "turn left." This is possible 
because 10/2 is odd. See Figure 4 for a picture of the good turns. 
Notice that well-tempered paths are well-behaved: 

Theorem A.l. For any ordered pair of vertices in the tiling D there is 
a unique well-tempered path connecting one to the other. Well-tempered 
paths are geodesic. Subpaths are again well-tempered as are inverses. 
Finally, well-tempered paths are locally detectable: to verify the prop- 
erty it suffices to check all subpaths of length five. 

Remark A. 2. Well-tempered words are similar to short-lex words in 
hyperbolic groups (see [8]). For example, both form regular languages 
which satisfy uniqueness and which are closed under taking subwords. 
Short-lex is generally not closed under taking inverses and is essentially 
never locally detectable. 
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Figure 4. All of the good turns are shown. 

Here is a sketch of the proof of Theorem A.l. Suppose that a and 
(3 are paths and the final vertex of a is the initial vertex of (3. Define 
a ■ (3 to be their concatenation. If a and (3 are well-tempered and a ■ (3 
is not then the free reduction (or bad turns) must overlap the point of 
concatenation. 

We may straighten a- (3 (rel endpoints) until it becomes well-tempered. 
There are four stages: 

(1) Free reduction. 

(2) Sweeping across two sides of at most one corridor. 

(3) Sweeping across at most three pieces. 

(4) Sweeping across one side of at most two corridors. 

Before elaborating on these we briefly give definitions: A piece is a 
path 5 of length two to ten where all edges of 5 are on the boundary of 
a single decagon of the tiling. Now, fix a decagon D = Dq and suppose 
that Di is the image of Dq under the i th power of a fixed side pairing 
transformation of Dq. The union C = U^Tq Dj is called a corridor. The 
two edges in dC corresponding to the transformation are the ends of 
C . The other two components of dC are the sides of C. Note that 
sides of corridors always have period four. See Figure 5. 

Suppose again that a and (3 are well-tempered and 70 = a ■ (3 is 
the concatenation. We can now flesh out the stages required to make 
7o well-tempered, assuming it is not already. First freely reduce, if 
possible, to produce 71. If 71 contains two sides and one end of a 
corridor C then sweep 71 across C to obtain 72. This deals with all 
pieces of length nine and ten. 
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Figure 5. An e-corridor of length three. 



Next sweep 72 across at most three pieces of lengths between five 
and eight to form 73. The pieces of length five are necessarily bad 
turns. The proof that there are at most three such is a lengthy but 
straight-forward combinatorial argument. 

If 73 contains an end and a side of a corridor C, forming a bad turn, 
then sweep 73 across C. This occurs at most twice. Call the resulting 
curve, which must be well-tempered, 7. These four stages simply move 
a ■ (3 through the thin triangle bounded by a, /3, and 7. See Figure 6. 



a 




FIGURE 6. A cartoon of a thin triangle in the decagon tiling. 

Recall that M. is the set of words corresponding to paths in the one- 
skeleton of the tiling D. For any compressed word A in M. let 7^ be 
the corresponding path. Again 7,4 is only defined up to the action of 
tti(S). The above discussion leads both to a proof of Theorem A.l and 
of: 

Theorem A. 3. There is a polynomial-time algorithm that, given a 
compressed word A in M. , finds a compressed word X where 

• the path 7^ is well-tempered and 

• 7a and 7x are homotopic rel endpoints. 
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Proof sketch. Suppose that the root of A has production A — > B-C . By 
induction assume that 7b and 7c are well-tempered. The first stage is a 
straight-forward application of Plandowski's Algorithm (Theorem 2.9). 
The second and fourth stages require both Plandowski's Algorithm and 
the fact that words of period four are highly compressible. To deal 
with the third simply examine a constant sized suffix of w(B") and a 
constant sized prefix of w(C"), where B" and C" are the compressed 
words output by the second stage. □ 

Remark A. 4. There is a subtlety hidden in this proof sketch - the 
compressed word X produced may have bit-size larger than that of A. 
Since the proof is inductive the growth must be carefully controlled, in 
part using Hagenah's Algorithm (Theorem 7.1) 

Remark A. 5. It may be possible to prove versions of Theorems A.l 
and A. 3 using short-lex paths. However the number of combinatorial 
possibilities appears to greatly increase. Also, I do not know how to 
control the growth in size indicated in Remark A. 4 when using short- 
lex paths. If this could be done then the entire discussion should apply 
to general word hyperbolic groups. 

From Theorem A. 3 deduce: 

Corollary A. 6. The word problem for compressed words in tti(S) is 
solvable in polynomial time. This gives a solution to the word problem 
in Aut(7Ti(5')). □ 

The compressed conjugacy problem follows from a careful reading of 
Epstein and Holt's paper [7]. In a few places their subroutines, acting 
on words, must be altered to act on compressed words. In particular 
a solution to the fully compressed matching problem (see the proof of 
Theorem 3.7) replaces the Knuth-Morris-Pratt algorithm for checking 
if two words are cyclic conjugates and for computing roots. Also, as 
noted above, the language of well-tempered paths is regular; this is 
used in their proof to find certain bounds. Thus: 

Theorem A. 7. The conjugacy problem for compressed words in vri(S') 
is solvable in polynomial time. □ 

Since vr 1 (S') has no torsion the simple version of Bridson and Howie's 
algorithm [6], adapted to compressed words, now solves the member- 
ship problem for the inner automorphism group Inn(7Ti(S')). Finally, 
recall Nielsen's Theorem (see [19, page 175]): the mapping class group 
M.CQ{S) is isomorphic to the outer automorphism group of ir\(S). So, 
similar to the proof of Theorem 6.1, well-tempered paths give: 
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Theorem A. 8 ([21], [11]). The word problem in AiCQ(S) is solvable 
in polynomial time. □ 
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